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APPARATUS AND METHOD FOR REMAINDER CALCULATION USING 
SHORT APPROXIMATE FLOATING-POINT QUOTIENT 



Field of the Invention 

The present invention relates to implementing efficient floating-point operations in 
a computer system. More particularly, the present invention related to creating computer- 
based lookup tables responsive to a set of accuracy requirements. 

> 

Background of the Invention 

As will be described in detail below, the calculation of elementary mathematical 
functions, such as a two argument arc tangent (tan 1 (A/B)), in a computer system may best 
be efficiently performed using a remainder from a floating-point quotient and lookup 
tables. Known prior art techniques do not typically create such lookup tables to efficiently 
return results of a specified accuracy. The present invention, however, is directed at using a 
floating-point remainder (%), derived from a floating-point quotient (Qjp), to create lookup 
tables based on an approximate remainder that "exactly" represents the underlying 
arithmetic value within the specified accuracy. The created lookup tables are thus 
efficiently tailored to the intended use, with no more and no less accuracy than will be 
needed in the particular application. Such efficiency can result in both faster calculations 
and more compact lookup tables. 

The remainder in a division process is an important entity in many floating-point 

calculations. In fact, the remainder calculation is considered so basic an operation that 

Express Mail No.: EL500380483US -2- 

Docket No.: 2207112617 

29986_5.DOC 



IEEE Standard 754 mandates such an operation be supported. IEEE Std. 754 - 1985, 
reaffirmed 1990, Standard for Binary Floating-Point Arithmetic. Unfortunately, most 
hardware implementations of the IEEE remainder are slow, with resources often dedicated 
to other operations such as basic floating-point add, subtract, and multiply, or others such as 
single-instruction-multiple-data operations. Moreover, in some common situations, the 
definition of the IEEE remainder is not "naturally" applicable, as the IEEE quotient is an 
integer quotient, whereas many applications require the remainder with a floating-point 
quotient. 
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Brief Description of the Drawings 

0004 Figure 1 is a flow chart of the implementation of a generalized floating-point 
remainder procedure used to create lookup tables in accordance with an embodiment of the 
present invention. 

0005 Figure 2 is a flow chart of an embodiment of the present invention using a design by 
rounding technique. 

p006 Figure 3 is a flow chart of an embodiment of the present invention using a design by 

IP quotient technique. 

98 

§3)007 Figure 4 is a flow chart for the creation of reciprocal tables using a design by 

rounding technique in accordance with an embodiment of the present invention. 



Q 
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Detailed Description 

0008 The present invention uses a floating-point quotient (Qf p ) instead of the more 

standard IEEE integer quotient (£),„/) to create lookup tables for efficient computer-based 
calculations. An approximation of a floating-point remainder (R^) can be derived from Q/ p 
andR/p can be used to create lookup tables which are created to the precise accuracy needed 
for a particular implementation. 
N9009 More specifically, the IEEE remainder R/eee of a dividend^ and divisor B is defined 



HI 



as: 

MEEE = A-Q int xB 

M where Q int is the quotient A/B rounded to the closest integer value. In many applications, 

jil however, the following floating-point remainder is more useful: 

fli 

in R fp =A --Qfi> xB 

13 

fl| where Qr P ~AIB and Q/ p has a fixed number of significant bits L. That is, Qf p is of the 

form: 

Qjp = 2 l x l.qiq 2 --.qL-i 

for a fixed L. 

0010 There are three general processes that are needed before actually creating the lookup 
table in accordance with the present invention, obtaining an approximate quotient (W), 
rounding that quotient, and computing the floating-point remainder 

0011 For the approximate quotient computation, obtain a floating-point value W: 

W~A/B. 
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Next, use some rounding method to obtain Qf P from W, thus: 

Qjp = rounding (W, direction L).) 
There are at least four rounding directions easily accommodated by the present invention: 
nearest, midpoint, inwards, and outwards, of which the first two are the most common. 
Finally, for the computation, compute %: 

Rfp = A-Qf P x B. 

!3>012 An important property in this remainder definition is that R/ P must be obtainable 

B exactly, that is, A - Q fp x B is a value representable exactly in the underlying floating-point 

05 

fi format. In order for this exactness property to hold, we must satisfy certain constraints 

^ relating the accuracy of the approximate quotient, Qfp the quotient length L, as well as the 

y rounding direction. In a typical scenario, one or more design parameters are fixed, such as 

jjj rounding length and direction, and the other parameter(s), in this case, the accuracy 

o 

iU requirement of the approximate quotient, must be determined to satisfy the intricate 

constraints. 

0013 With the present invention, in addition to creating lookup tables, we can check the 

design of a general floating-point quotient based remainder instruction with different design 
constraints. For example, one could calculate the accuracy required for the approximate 
quotient to design a floating-point quotient based remainder algorithm that uses a 
"nearest" rounding to 5 significant bits. Alternatively, one could find how many significant 
bits can be used as the "midpoint" quotient, given an approximate quotient with a given 
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accuracy. Applications for this remainder operation include, but are not limited to, the 
computation of elementary functions, IEEE division, and square root operations. 
0014 We discuss the present invention in comparison with both the formulation of the 

remainder to the IEEE remainder definition, in certain applications, and with a "manual" 
approach, where a designer will typically carry out analysis for each specific set of 
constrained parameters. 
14)015 First, a discussion of the floating-point quotient over integer quotient. When 

computing functions with the quotient of arguments f (A/B) 9 a table-lookup technique, 

may be useful to obtain a short, say, 4-significant-bit, approximation Q to A/B and exploit 

ffk 

%J specific relationship: 

I f(A/B)=f(Q) + g((A/B)-Q). 



Mi 

60 



12 



W)016 |(^/5)-g|isoftensmallerthan|^/5| by a factor of 1/32. This "shift off ' 

property is important as the rounding error in computing g is relatively insignificant. 
Clearly, (A I B) - Q = (A - QB) I B , which includes our definition of the remainder If 
one were to instead use an IEEE remainder, the alternative is to restricts and B so that 
| A | < | B | and find the remainder: 

Rieee=\6x A-QxB. 
or, f(A/B)=f(Q/16)+g((l6A/B-Q)/l6). 
0017 There are two drawbacks to using the Rieee approach. First, we have to perform 
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some preprocessing steps such as restricting ourselves to | A |< | B | and scale up A by 1 6. 
Second, and more important, is that the argument to g is no longer always of a factor of 32 
or so smaller than \A/B\. In fact, when | A/B \ ~ 1/16, the argument tog may be only 
smaller than | A/B | by a factor of 2. 

Another situation, which is getting more common in modern computer architectures, 
is that the IEEE division is typically implemented in software. In this situation, a fast 
floating-point remainder (Rf p ) based on, a 4-significant-bit floating-point remainder can be 
used to implement a 4-bit at a time division algorithm. However, with an IEEE type of 
remainder calculation, every step of the iteration requires explicit scaling. 

In the IEEE remainder, the integer quotient is defined as an infinitely precise 
quotient (A/B) rounded to the nearest integer value. This definition is extremely tight and 
thus most IEEE remainder operations are implemented on top of IEEE division operations. 
Worse still, in hardware where an aggressive division algorithm is used, a separate radix-2 
restoring algorithm (a simple but slow available algorithm) is typically used to calculate the 
IEEE remainder. That the remainder can be represented, and thus computable exactly, even 
if the short quotient is "inexact" is crucial. The present invention's use of an approximate 
quotient allows much more flexibility, such as the use of built-in approximate reciprocal or 
other ad-hoc approaches to calculate approximate, and even possibly biased, quotients. 

There are also advantages to using different rounding directions. In contrast, the 

IEEE remainder defines the integer quotient to be the round-to-nearest integer value of the 

infinitely precise quotient. The present invention's formulation allows for multiple 
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rounding directions that may be better suited for certain situations. 

The present invention is based on a detailed general error analysis allow us to 
explore, in an automated manner, the vast design space. Niche areas in the design space can 
often be missed when not explored exhaustively via machine automation. Furthermore, 
repeated manual analysis is error prone, especially in some subtle "corner" cases. 

The general framework of three of the building blocks of the remainder (Rjp) 
operation is as follows: 



1. 



where, 



and 



Approximate quotient computation: Obtain a floating-point value 



B B. 



A = c>Ax2 m xa, 
B=o B x2 n xb, 
Oa> o b = ±1, 
\<a y b <2. 



form: 



Rounding: Use some rounding method to obtain Qj P from W. Wis of the 
W — a w 2 k x l.wiW2*..w p — j 
where a w = ± 1 and p is the native floating-point precision of the computer. For example, 
p = 24 single precision and p = 53 for double precision under the IEEE standard. We 
obtain: 

Qf p = rounding ( W> direction, L)= W + a w x 2 k xfi 
where the exponent k is the exponent of W. 
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Four common rounding methods that are easily accommodated by embodiments of 
the present invention, nearest, midpoint, inwards, and outwards. However, those of ordinary 
skill in the art will recognize that the present invention could easily be adapted to virtually 
any rounding method and is not intended to be limited to any particular rounding method. 
The definitions of the rounding modes can be seen in terms of the range of values in terms 
of fiminmA yS ^ , and where the value of jSmin and J3 max are tabulated below: 

max 



direction 


j3 nun 


j5 max 


nearest 


-l/2 L 


1/2 1 


midpoint 


-2/2 L 


2/2 L 


inwards 


-111 1 


0 


outwards 


0 


2/2 L 



3 . Remainder computation, compute R: 

R=A-QxB, 

using any standard technique. For example, when computed without the use of extra 
precision, we can splits into Bm and5/ 0 such that Q x Bhi and Q x Bio are computable 
without error. The remainder is simply obtained as (A - Q xBhi) ~QxBi 0 . Using an 
architecture with a multiply-accumulate instruction the remainder may be obtained via one 
multiply-accumulate instruction. 

An embodiment of the present invention works as follows: it accepts a number of 
design parameters fixed by the designer, and then produces the constraints for the 
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unspecified parameters that would allow for an exactly representable remainder. The 
present disclosure will first describe the general flow of the apparatus, and then describe 
the three basic building blocks of the apparatus in detail 

The present invention can use either of two procedures, design by rounding and 
design by quotient, based on the supplied parameters. The process is diagrammed in 
Figure 1 . First, a range of input values for^4 and B is selected 2, which affects the range of 
values for a and b. 

a min < a <a max and b min < b <b max . 

Note the strict upper bound. If A and B are general floating-point numbers, we set 
a min> b min = 1 and a max> b max = 2. In some situations, the divisor may be a fixed constant, in 
which case, we can setb^ = b and b max = b + e . 

Next, select the design method 4 . This is either "design by rounding" where the 
rounding direction and length L are specified 6 and where the designer wishes to obtain 
accuracy requirements on the approximate quotient, or "design by quotient" where the 
accuracy characteristic of the approximate quotient is specified together with a rounding 
direction 8. The apparatus will then produce a list of usable rounding lengths!. If design 
by rounding is selected, obtain rounding direction and rounding length L 6. Otherwise, 
obtain rounding direction and accuracy characteristic of approximate quotient 8. That is, 
obtain amin and a max where: 

W=—(l + a) 
B 
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OC Mi 5s (X ^ (X max . 



If design by rounding is selected, the apparatus will produce accuracy requirement 
for the approximate quotient 10. That is, the apparatus will produce a mm and a max where 
the approximate quotient must satisfy 

— (1+a) 
B 1 7 

tfmin < a < a max. 

otherwise, the apparatus will produce a list of acceptable rounding lengths I. 

The present invention uses the following error analysis, which those of ordinary skill 
in the art will recognize can be used to adapt the present invention to a wide variety of 
applications. 

Defined, B, and Was follows: 

A = o Ax2 m xa 
B = a B x2 n xb 

W= O w x2 k X l.WlW2...W p -i, 

where a w = a ax g b and k = (m - n) + <5. And, impose the restriction that: 

-1 <d<l. 

Then, a sufficient condition for the remainder to berepresentable in the underlying 
^-significant bit floating-point format is: 

\A-QxB\< 2 p hb{QxB-A). 

where *lsb" signifies the least significant bit. 
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Now, \sh(QxB)>2 k ' L+1 x2 n+] ' p 
and \sh(A)>2 m+1 -P. 

Thus, 2 p x lsb(QxB-A)>mm(2 k - L+1 x 2 n+1 , 2 m+1 ). 

0032 Next, we express: 

Q = W+o w x2 k J3 



giving: A-QxB=-<jA(a/A/ +2 k J3x!B/). 



Jip033 Hence, the sufficient condition for a representable remainder is: 

M -mm (2 k ' L+1 x 2* +1 , 2 m+1 ) < al m a+ 2 k $ T b < min (2*" w x 2 n+l , 2 m+1 ). 

N. 

LP034 This condition is exploited by the present invention. For example, given a rounding 

k| method and length p is bounded by specific values of p min and p max . Then as long as: 

in 

9 -min^ L+/ x2 n+/ ,2" J+i ;-^iS2"Z> . 
fa max ( ' — )<a 

2 m a 

. ,rmn(2 k - L + ] x2 n+] ,2 m+1 )-2 k p2 n b, 
and, a<mm( ) 

2 a 

then, the remainder will be representable exactly. 
0035 The present invention uses three building blocks, defined below: 

{fi min, (5 max), 
(S min, 8 max,OK) , 
and (Lo f OK). 



Express Mail No.: EL500380483US 
Docket No.: 2207112617 
29986__5.DOC 



0036 (/3 min, (3 max) -Beta _ Bound (direction, Z). This returns the value of /? min and 

(3 max according to the following table: 



s 



in 



0037 



direction 


j3 min 


f3 max 


nearest 


-1/2 L 


\I2 L 


midpoint 


-2/2 L 


2I2 L 


inwards 


-2/2 1 


0 


outwards 


0 


2I2 L 



( a min, a max, OK )~ Alpha _ Bound ( J3 min, (5 max, L, a max, b max). This computes 
a min and a max based on the formula: 

/ min(2 5+2 - L ) 2) + /3minAmax2 5 ) 



a: mm = mm , 

-1<5<1 



d max 



and 



a mix = nun 

-1<6<1 



min(2 + ,2)-/3max bm^2 



CI max 



Set OK to be true if (Xmin<0,amax>0, and (Xmm <amax, and false otherwise. 
(Lo 9 OK)=L_Bound(amm,anw,awn). Which returns: 
Lo = min (U§, Vs\ 

-\<5<\ 



C/ff = [ - 10g2( | a min | a max/ 2 2 "* )] , 



where 

and ^=[~l0g2(amaxamax/2 Z ^°)]. 

Set OK to be true if a max a max< 2 and | a min | a max < 2, and false otherwise. 
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0038 Figure 2 depicts the procedure to perform the design by rounding in detail. Given 

rounding direction and length 6, invoke the Beta_Bound building block to obtain j3 min and 
/5 max 14. Use the values of {3 mm and (3 max thus obtained and amax,bmax to invoke the 

AlphaJBound building block 16. If OK produced by AlphaJBound is false, the design is 
infeasible for the given input parameters. If OK is true 18, the accuracy requirement on the 
approximate quotient is given by a min and a max produced by the AlphaJBound building 
O block. 

M 

111)039 Figure 3 depicts the procedure to perform the design by quotient in detail. Given 

flfl 

the accuracy bounds 20 on the approximate quotient U =a min < 0, V =a max> 0, rounding 

M 

a direction 22, and a max , invoke the L JBound building block to obtain Lo and OK 24. If OK 



if? 5 



{* 26 produced by LJBound is false, the design is infeasible. If OK 26 is true, then for each 

III 

51 L = Lo, Lo - 1, . . . ,2 then continue. Invoke Beta J3ound based on L and rounding direction 28 . 

Invoke AlphaJBound using the j3 min, j3 max, L, (2 max, and bmax 30. If OK from Alpha_Bound is 
true and the a min andtfmax thus obtained satisfy (Xmin<U and V<amax then this L is 

acceptable 32. Otherwise this L value is unacceptable 34. After examining the 
AlphaJBound to determine whether the current L value is acceptable 32 or unacceptable 34, 
the current L value might be decreased 36 where feasible. 
0040 One standard function in a run-time library of most computer systems is the two- 

argument inverse tangent function atan2 (YJC), where in general we need to compute the 
value of arctan ( Y/X) for floating point inputs Zand 7. Typically, the computational method 
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takes advantage of the simple sign symmetry of the function and considers the case 0 <XJ. 
A fast algorithm based on table lookup technique can be applied to the common case where 
the range of Y/Xis limited, say, to: 

2~ 5 <(7/X)<2 5 . 

For this range, we find a floating-point value Q with a few significant bits such that 
Q~ Y I X and compute arctan( Y/X) via the formula: 

r Y —OX ^ 

arctan(7/X) = arctan(0 + arctan *~ . 

v ^ J 

All the possible values of arctan(0, within the range, are calculated beforehand and 
stored in a table. For accuracy purposes, we would like to compute Y- QX without error. 
One way to obtain Q is to first, compute W ~ Y IX by some method. Wean be expressed 

as: 

On some machines with an efficient bit manipulation instruction, it is convenient to define 

Q as an L significant number of bits by: 

Q = 2 m x I.WJW2W3WL— 2I. 

This technique is the midpoint rounding mechanism. The present invention may be used to 

inquire how much accuracy is needed for representing W, and efficiently create lookup 

tables for its calculation. 

First, input the range of the fractional part of Zand 7 ? which is set at (1,2). Specify 

design by rounding, with midpoint rounding as the choice, and specify L = 5 so that for each 
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m value there are 16 entries of table to store. For the parameters chosen, the present 
invention indicated that the design constraints not met. This signifies that the remainder Y- 
QXmzy not berepresentable in the underlying floating-point format. 
0044 So, we can try again with a different rounding scheme and specify rounding to 

nearest instead. Again, input the range of the fractional part of Zand Y 9 which is (1,2). 
Specify design by rounding with nearest rounding as the choice, and specify L = 5, This 

?J time the parameters result in a requirement of amin = - 0.015625 and a max = 0.015625. 

O 
O 

'In Using a computer with an approximate reciprocal instruction that delivers approximately 12 

U significant bit of accuracy, W is calculated by: 

M W:=Y x approximate-reciprocal(Z) 

}f satisfies the required bounds on the a's. This combination of approximate W calculation, 

IH rounding method, and table size is used by the present invention to create lookup tables. 

3 

I1J0045 Without this method, we might have used the more convenient midpoint rounding 

method, leading to loss of accuracy. The rounding to the nearest method in obtaining Q 

from Wis only very slightly more expensive (on the order of 2 extra cycles), but was 

revealed to be necessary by the present invention. 

0046 On architectures without approximate reciprocation capabilities, one would typically 

create an approximate reciprocal table. One way to create an approximate reciprocal table 

for a floating number X= 1 jc\X2- . .jc m . . .x v - 1 is to use the m significant bits xi% 2 . . -x m to 

serve as an index. Thus, X=l-Hy/2 m + <5, and the table contains the working-precision 

value of Bj = 1/(1 + j / 2 m + 1/2 W + x ). Given X in the above form, we obtain j by extracting 
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the bits xi through x m . Bj would approximate l/Xin the sense: 

\BX-l\<2 <m + l) (\-2 (m + 2) ). 
0047 For a specific value of L in the arctangent algorithm, how large of a reciprocal table 

is sufficient? The present invention can generate the m values, for each specified value of 
Z, that satisfy the bounds on the a' s , and thus produce the reciprocal table and bounds on 
the remainder. That is, the present invention creates tables of reciprocal values that exactly 

l& represent the approximate remainder value within the accuracy specified. 

P 

130048 Figure 4 contains a flow chart of the table creation process. First, the input ranges 

m 

r 1 of A and B are specified 38, along with a preferred rounding direction and accuracy 40. 

f& 

8S 

;q Next, the values of Beta 42 and Alpha 44 are calculated, with Alpha used to determine the 

minimum value of m for the reciprocal table 46. The reciprocal table is then created 48 
based on the accuracy required, preferable for the minimum value of m as well as higher 
values. Lastly, the value of I may, optionally, be incremented 50, and the process repeated 
in order to create additional tables. 
0049 Tables 1 and 2 below are example reciprocal tables for the arctangent function 

created by the present invention. 
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Reciprocal Table forZ = 3 





m = Z 


m = 4 


w = 5 




|7-gX| = 0.3125 


|7- QX\ = 0.28125 


\Y- QX\ = 0.265625 


J 








0 


0.94117647 


0.96969697 


0.98461538 


1 


0.84210526 


0.91428571 


0.95522388 


2 


0.76190476 


0.86486486 


0.92753623 


3 


0.69565217 


0.82051282 


0.90140845 


4 


0.64 


0.7804878 


0.87671233 


5 


0.59259259 


0.74418605 


0.85333333 


6 


0.55172414 


0.71111111 


0.83116883 


7 


0.51612903 


0.68085106 


0.81012658 


8 




0.65306122 


0.79012346 


9 




0.62745098 


0.77108434 


10 




0.60377358 


0.75294118 


11 




0.58181818 


0.73563218 


12 




0.56140351 


0.71910112 


13 




0.54237288 


0.7032967 


14 




0.52459016 


0.68817204 


15 




0.50793651 


0.67368421 


16 






0.65979381 


17 






0.64646465 


18 






0.63366337 


19 






0.62135922 


20 






0.60952381 


21 






0.59813084 


22 






0.58715596 


23 






0.57657658 


24 






0.56637168 


25 






0.55652174 


26 






0.54700855 


27 






0.53781513 


28 






0.52892562 


29 






0.5203252 


30 






0.512 


31 






0.50393701 



TABLE 1 



Reciprocal Table for L = 4 
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m = 4 
\Y- QX\ = 0.15625 


m = 5 
\Y- 0X| = O.14O625 


m = 6 
\Y-QX\ = 0.1328125 


J 


5/ 




Bj 


0 


0.96969697 


0.98461538 


0.99224806 


1 


0.91428571 


0.95522388 


0.97709924 


2 


0.86486486 


0.92753623 


0.96240602 


3 


0.82051282 


0.90140845 


0.94814815 


4 


0.7804878 


0.87671233 


0.93430657 


5 


0.74418605 


0.85333333 


0.92086331 


6 


0.71111111 


0.83116883 


0.90780142 


7 


0.68085106 


0.81012658 


0.8951049 ! 


8 


0.65306122 


0.79012346 


0.88275862 


9 


0.62745098 


0.77108434 


0.8707483 


10 


0.60377358 


0.75294118 


0.8590604 1 


11 


0.58181818 


0.73563218 


0.84768212 


12 


0.56140351 


0.71910112 


0.83660131 


13 


0.54237288 


0.7032967 


0.82580645 


14 


0.52459016 


0.68817204 


0.81528662 


15 


0.50793651 


0.67368421 


0.80503145 


16 




0.65979381 


0.79503106 


17 




0.64646465 


0.78527607 


18 




0.63366337 


0.77575758 


19 




0.62135922 


0.76646707 


20 




0.60952381 


0.75739645 


21 




0.59813084 


0.74853801 


22 




0.58715596 


0.73988439 


23 




0.57657658 


0.73142857 


24 




0.56637168 


0.72316384 


25 




0.55652174 


0.7150838 


26 




0.54700855 


0.70718232 


27 




0.53781513 


0.69945355 


28 




0.52892562 


0.69189189 


29 




0.5203252 


0.68449198 


30 




0.512 


0.67724868 


31 




0.50393701 


0.67015707 


32 






0.66321244 


33 






0.65641026 


34 






0.64974619 


35 






0.64321608 


36 






0.63681592 
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37 






0.63054187 


38 






0.62439024 1 


39 






0.61835749 


40 






0.61244019 


41 






0.60663507 


42 






0.60093897 


43 






0.59534884 J 


44 






0.58986175 


45 






0.58447489 


46 






0.57918552 


47 






0.57399103 


48 






0.56888889 


49 






0.56387665 


50 






0.55895197 


51 






0.55411255 


59 






0.54935622 


53 






0.54468085 


54 






0.54008439 


55 






0.53556485 


56 






0.53112033 


57 






0.52674897 


58 






0.52244898 


59 






0. 51821862 


60 






0.51405622 


61 






0.50996016 


62 






0.50592885 


63 






0.50196078 



The above arctangent example is merely one of the many lookup tables that may be 
created by the present invention, which is not intended to be limited to use with arctangents 
or any other particular function. Those of ordinary skill in the art will recognize, with the 
benefit of the present disclosure, the adjustments that must be made to adapt the present 
invention to create lookup tables of a given accuracy for other functions. Embodiments of 
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the present invention may also be used to create pairs of tables, in order to provide even 
greater flexibility. The example below demonstrates such a two table approach to 
computing the axctm(Y/X) which exploits use of Qjp twice. 

First, a reciprocal table (first table) is used to obtain B, where B ~ — and W is 

X 

computed, using: 

W^YxB 

Rounding is used to obtain a suitable Q^ t where: 
Next, the floating-point remainder is obtained. 

Finally, using a second table for the term arctan(g^), arctan(7ZY) is computed as: 

arctan(y/X) = arctan (Qjp) + arctan (Z) 
where Zis computed based on Rfr. The size of both tables are based on the accuracy of B 
and Z, and the magnitude of Z. 

Those of ordinary skill in the art will recognize, in light of the present disclosure, 
that the above "multiple table" approach can easily be extended to utilize any number of 
tables, and the present invention is not intended to be limited to use with one, two, or any 
particular number of look-up tables. In this way, embodiments of the present invention 
allow a very wide number of design choices which may be tailored to the constraints 
encountered. 
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The present invention is also not intended to be limited to creating tables of a given 
accuracy for any particular radix. That is, the tables could be created in binary, octal, 
decimal, hexadecimal, or any other system, so long as the distance between the most 
significant digit (msb) and least significant digit (lsb) is with the required precision when 
represented in floating-point format. 

Although the above disclosure provides various embodiments and examples of the 
present invention for the purposes of illustration, these embodiments and examples are not 
intended to be an exhaustive list of all possible implementations of the present invention 
and should not be construed in limiting the present invention. Those of ordinary skill in the 
art should recognize, with the benefit of the present disclosure, that the present invention 
may be practiced with many modifications and variations to the specific details of the 
present disclosure. For example, embodiments of the present invention might be described 
as being implemented in hardware, while those of ordinary skill in the art will recognize the 
possibility of a software implementation. In such cases, the present invention is intended to 
cover hardware and software implementations, and combinations of the two. Similarly, not 
all the specific details, well-known structures, devices, and techniques that are known to 
those of ordinary skill in the art have been shown in order to avoid observing the present 
invention. The present invention is, however, intended to cover a broad range of techniques, 
devices, and well-known structures. The invention, therefore, is intended to be limited in 
scope only by the purview of the appended claims. 
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